Monitoring sentiment in open source mailing lists: exploratory study on the apache ecosystem

نویسندگان

  • Parastou Tourani
  • Yujuan Jiang
  • Bram Adams
چکیده

Large software projects, both open and closed source, are constructed and maintained collaboratively by teams of developers and testers, who are typically geographically dispersed. This dispersion creates a distance between team members, hiding feelings of distress or (un)happiness from their manager, which prevents him or her from using remediation techniques for those feelings. This paper evaluates the usage of automatic sentiment analysis to identify distress or happiness in a development team. Since mailing lists are one of the most popular media for discussion in distributed software projects, we extracted sentiment values of the user and developer mailing lists of two of the most successful and mature projects of the Apache software foundation. The results show that (1) user and developer mailing lists carry both positive and negative sentiment and have a slightly different focus, while (2) work is needed to customize automatic sentiment analysis techniques to the domain of software engineering, since they lack precision when facing technical terms Keywords—Empirical Software Engineering, Sentiment Analysis, Mining Software Repositories, Mailing List Data

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On the Central Role of Mailing Lists in Open Source Projects: An Exploratory Study

Mailing lists provide a rich set of data that can be used to improve and enhance our understanding of software processes and practices. This information allows us to study development characteristics like team structure, activity, and social interaction. In this paper, we perform an exploratory study on the GNOME project and recover operational knowledge from mailing list discussions. Our findi...

متن کامل

Analysis of Coordination Between Developers and Users in the Apache Community

Coordination is one of the keys for the success of open source software (OSS) communities because geographically distributed members need to collaborate on their work using communication tools (e.g., mailing lists, bulletin board systems, bug tracking systems, and so on). In this paper, we investigated the informal social structure among developers and users by analyzing two mailing lists of de...

متن کامل

The Nagios Community: An Extended Quantitative Analysis

The health of an Open Source ecosystem is an important decision factor when considering the adoption of an Open Source software or when monitoring a seeded Open Source project. In this paper we assess the ecosystem health using approaches involving domain analysis and social network analysis of mailing lists for the Nagios project. We elaborate approaches for how involvement of different roles ...

متن کامل

Predicting Email Response using Mined Data

Mailing lists are the primary medium of communication in open source projects. For some projects the sheer volume of emails on the mailing lists becomes unmanageable and messages may begin to be ignored. This can have a number of negative effects on an open source project. We present a way to predict who is most likely to respond to an email, thus providing the potential of giving mailing list ...

متن کامل

Making and Sharing Knowledge at Electronic Crossroads: the Evolutionary Ecology of Open Source

Based on the analysis of developer mailing lists of two large-scale open source projects, we argue that, in open source development, processes of knowledge making and sharing exploit the structuring properties of high density, massive interaction for evolutionary purposes. The mailing lists reveal patterns of activity and resource distribution that exhibit ecological features. A high number of ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014